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I. Introduction 

This paper describes the ratpoints program. This program tries to find all 
rational points within a given height bound on a hyperelliptic curve in the most 
efficient way possible. 



2. History and Acknowledgements 

This program goes back to an implementation of the 'quadratic sieving' idea by 
NOAM ELKIES that was around in the early 1990s. My own first contribution 
was to replace the char arrays that were used to store the sieving information by 
bit arrays, in 1995. Colin Stahlke then made use of the gmp library, so that 
points could be checked exactly, and implemented the selection of sieving primes 
according to their likely success rate, in 1998. After that, I successively put in 
numerous improvements (and some bug fixes). For details of how the program 
works, see Section [8] below. 

Along with Noam Elkies and Colin Stahlke, I would like to thank John 
Cremona and Sophie Labour for bug reports and suggestions for improve- 
ments. 



3. Availability 

The ratpoints-2 .1.1 package can be downloaded from my homepage, see [rat] . 

This program is free software: you can redistribute it and/or modify it under 
the terms of the GNU General Public License as published by the Free Software 
Foundation, either version 2 of the License, or (at your option) any later version. 

This program is distributed in the hope that it will be useful, but without any 
warranty; without even the implied warranty of merchantibility or fitness for 
a particular purpose. See the GNU General Public License for more details. 

You should receive a copy of the GNU General Public License and the GNU Lesser 
General Public License along with this program. 

If not, see http://www.gnu.org/licenses/. 



Date: April 15, 2009. 
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4. Installation 
This section describes the installation procedure under Linux. 

4.1. Extract the archive. 

> tar xzf ratpoints-2 . 1 . 1 . tar .gz 

This sets up a directory ratpoints-2 .1.1 containing the various files that belong 
to the installation. 

4.2. Build the program and library. 

Do 

> cd ratpoints-2 . 1 . 1 

> make all 

This will build the library libratpoints . a and the executable ratpoints. 

By default, the program will use primes up to 127. If you intend to do compu- 
tations with large height bounds or with curves that you expect to have many 
points, it may make sense to increase the range of primes. This can be achieved 
via 

> make all PRIME_SIZE=8 

perhaps after a 

> make distclean 

to remove the files that were generated previously. The PRIME_SIZE argument can 
be given any value from 5 to 10; otherwise it is taken to be 7. The precise meaning 
is that the program will work with primes < 2 s , when PRIME_SIZE = s. 

The program now uses SSE instructions if available. If you don't want this, do 

> make distclean 

> make all CCFLAGS=-UUSE_SSE 

4.3. Run a test. 
Run 

> make test 

in the working directory. This will build an executable rptest and then run (and 
time) it. Finally, the output (which was written to a file rptest . out) is compared 
against testbase, which contains the output of a sample run. The two should be 
identical. 
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4.4. Install. 

If you like, you can install the library, executable and header file on your system. 

> su 

> make install 

The executable is copied to /usr/local/bin/, the library to /usr/local/lib/, 
and the header file to /usr/local/include/. You can change the /usr/local 
prefix by giving the option INSTALL_DIR= . . . to make install. 

4.5. Debugging. 

In case you found a bug and would like to find out where it comes from, or if you 
just want to see exactly what the program is doing, you can do 

> make debug 

in the working directory. This will build an executable ratpoints-debug, which, 
when run, will dump loads of output on the screen, so it is best to send the output 
to a file, which you can then study at leisure. 

4.6. Cleaning up. 

In order to get rid of the temporary files, do 

> make clean 

To remove everything except the files from the archive, use 

> make distclean 

4.7. Requirements. 

You need a C compiler (like gcc, which is the default specified for the CC variable 
in Makefile; if necessary, it can be changed there). 

In addition to the standard libraries, the program also requires the gmp (GNU 
multi-precision) library jgmpj . 

4.8. List of files. 

The archive contains the following files. 

• Makefile 

• ratpoints .h — the header file for programs using ratpoints 

• rp-private .h, primes. h — header files used internally 

• gen_f ind_points_h . c , gen_init_sieve_h . c — short programs that write 
additional header files depending on the system configuration 

• sift.c, init.c, sturm.c, f ind_points . c — the source code for the 
ratpoints library 

• main.c, rptest.c — the source code for the ratpoints and rptest ex- 
ecutables, respectively 

• testdata.h, testbase — data for the test run 
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• ratpoints-doc.pdf — this documentation file 

• gpl-2.0.txt — the GNU license that applies to this program. 

5. HOW TO USE ratpoints 

5.1. Basic operation. 

Let 

C : y 2 = a n x n + a n -\x n ~ + • • • + a±x + ao 
be your curve, and let H be the bound for the denominator and absolute value of 
the numerator of the x-coordinate of the points you want to find. The command 
to search for the points is then 

> ratpoints 'a ai . . . a n _i a n ' H 

The first argument to ratpoints is the list of the coefficients, which have to be 
integers, are separated by spaces, and are listed starting with the constant term. 
There is no bound on the size of the coefficients; they are read as multi-precision 
integers. 

The degree n of the polynomial is limited to RATPOINTS_MAX_DEGREE (which is 
defined in ratpoints .h), which is set to 10 by default. 

The program will give an error message when the polynomial (considered as a 
binary form of the smallest even degree > n) is not squarefree (e.g., if you specify 
two leading zero coefficients). 

The following subsections discuss how to modify the standard behavior. This is 
achieved by adding options after the two required arguments. Some of the options 
have arguments, others don't; the ordering of the options and option- argument 
pairs is arbitrary (except for options with opposite effect, where the last one given 
counts, and for the specification of the search intervals, which must be in order). 

5.2. Early abort. 

By default, ratpoints will search the whole region that you specify and print all 
the points it finds. If you just want to find one point (for example when dealing 
with 2-covering curves of elliptic curves), you can tell the program to quit after it 
has found one point via the -1 option, e.g., 

> ratpoints '-18 116 48 -12 30' 60000000 -1 

If you don't want to see points at infinity, specify the -i option. The two options 
can be combined: -i -1 will stop the program after it has found one finite point. 

5.3. Changing the output. 

By default, ratpoints prints some general information before and after the points, 
which are given in the form (x : y : z) , one per line. Here, (x : y : z) are 
the coordinates of the point considered as a point in the (1, \n/2\ , l)-weighted 
projective plane, which is the natural ambient space for the curve. The coordinates 
are integers with x and z coprime and z > 0, or z = and x = 1 (for points at 
infinity) . 
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There are various ways to change this behavior. 

• To suppress all output except the points, use -q (for quiet). 

• To add some more output explaining what the program is doing, use -v 
(for verbose). This has no effect if the -q option is also given. 

• To suppress printing of the points, use -z. 

• If you only want to list the ^-coordinates of point pairs rather than indi- 
vidual points, use -y. 

• There are four options that influence the format in which the points are 
printed: 

-f format -f s string-before -fm string-between -f e string-after 
The arguments to all of them are strings; "\n", "\t", "\\" and "\%" arc 
recognized and do what you expect. The format string can contain markers 
%x, °/„y, y„z that will be replaced with the x, y, and z-coordinate of the point, 
respectively. The defaults are empty for string-before, string-between and 
string-after, and " (°/ x : %y : °/ z)\n" for format when -y is not specified; 
otherwise " (%x : %z)\n". 

The effect is as follows. Before any point is printed, string-before is output. 
Then every point is printed according to format. Between any two points, 
string-between is output, and after the last point, string- after is output. 

As an example, consider the effect of 

-f "[•/.x.'/.y.'/.z]" -fs "{" -fm \" -fe "}\n" 

5.4. Restricting the search domain. 

There are two ways to restrict the domain of the search. The first is to restrict 
the range of denominators considered. This is done via 
-dl d min -du d max . 

For example, to only look for integral points, you can say -du 1. By default, the 
lower limit is 1 and the upper limit is H . 

The other way is to restrict the search to a union of (closed) intervals. If you only 
want to search for points in 

[h,ui] U [l 2 , u 2 \ U • • • U [l k , u k \ , 

you can specify this in the form 

-1 li -u Ui -1 l 2 -u u 2 ■ ■ ■ -1 h ~U U k ■ 
The first -1 option is optional; li defaults to — oo. Similarly the last -u option 
is optional, and u k defaults to +oo. The number k of intervals is bounded by 
RATP 1 NTS _MAX_DEGREE (usually, 10). 

5.5. Setting the parameters for the sieve. 

It is possible to change the number of primes that are used in the various stages 
of the algorithm. This is done by the following options. 
-pM-NiV-nii 
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This sets the number of (odd) primes that are considered for the sieve to M, the 
number of primes that are actually used for the sieve to N, and the number of 
primes used in the first sieving stage to n. The program will, if necessary, reduce 
M, N, and n (in this order) to ensure that n < N < M < RATPOINTS_NUM_PRIMES. 
The latter is usually 30 (the number of odd primes < 2 7 ), but will be 53 (the 
number of odd primes < 2 8 ) if PRIME_SIZE is set to 8, etc. 

There are two more parameters that can be set. 
-F D 

sets the maximal number of 'forbidden divisors'. If the degree is even and the 
leading coefficient is not a square, then the denominator of any x-coordinate of a 
rational point cannot be divisible by a prime p such that the leading coefficient 
is a non-square mod p. If the leading coefficient is divisible by p, but not a 
p-adic square, the denominator cannot be divisible by some power of p. The 
program constructs a list of such forbidden divisors against which to check the 
denominators, and this option specifies how many of these should be used. 

The other option is 
-S S or -s . 

In the first form, it sets the number of refinement (interval halving) steps in the 
isolation of the real components to S. This part of the program computes a Sturm 
sequence for the polynomial and uses it in order to find a union of intervals that 
contains the intervals of positivity of the polynomial. This is done by a successive 
subdivision of the interval ]— oo, +oo[. The number S sets the recursion depth. S 
can be omitted; then it is given a default value. The option -s skips the Sturm 
sequence computation completely. This also has the effect of removing most of 
the check for squarefreeness. 

5.6. Switching off optimizations. 

The options -k and -j can be used to prevent the program from reversing the 
polynomial, which it usually does if this will lead to faster operation (-k), and to 
prevent the program from using the Jacobi symbol test on the denominators ( _ j). 
This last test extends the 'forbidden divisors' method described in Subsection 15. 51 
above by computing the Jacobi symbol (l/d), where I is the leading coefficient 
and d is the odd and coprime-to-/ part of the denominator. If the symbol is —1, 
the denominator need not be considered. The use of these options may be ques- 
tionable, unless you want to see how much performance is gained by using these 
optimizations. 

5.7. Switching off exact testing of points. 

The option -x will prevent the program from checking the potential points that 
survive the sieve whether they really give rise to rational points. This implies that 
the points that are output may not actually be points. It also effectively sets the 
-y option, since the y-coordinates are not computed. 
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5.8. Overriding previous options. 

The options -I, -Y, -Z, -S, -K, -J, -X can be used to cancel the effect of the 
corresponding lower-case option (and thereby restore the default behavior), when 
this occurs earlier in the list of options. This may be useful if you want to set a 
default behavior that is different from what the program does out-of-the-box (e.g., 
in a shell script), but want the caller to be able to override this change. 

6. HOW TO USE THE LIBRARY 

It is possible to use the ratpoints machinery from within your own programs. The 
library libratpoints . a provides the following functions. 

long f ind_points (ratpoints_args* , 

int procQong, long, const mpz_t, void*, int*), 
void*) ; 

void f ind_points_init (ratpoints_args*) ; 

long f ind_points_work(ratpoints_args* , 

int procQong, long, const mpz_t, void*, int*), 
void*) ; 

void f ind_points_clear (ratpoints_args*) ; 

The passing of arguments to these functions is via the ratpoints_args structure, 
which is defined as follows. 

typedef struct { mpz_t *cof ; long degree; long height; 

ratpoints_interval *domain; long num_inter; 
long b_low; long b_high; long spl; long sp2; 
long array_size; 

long sturm; long num_primes; long max_f orbidden; 
unsigned int flags; ...} 
ratpoints_args ; 

The dots at the end stand for additional fields that are used internally and are not 
of interest here. When calling f ind_points, the cof field must point to an array 
of size degree + 1 of properly initialized gmp integers; these are the coefficients 
of the polynomial. The program can alter the values of the integers in this array. 
To prevent this, set the RATPOINTS_NO_REVERSE bit in flags; this may result in a 
loss of performance, though. 

height gives the height bound. It is an error for the degree or the height bound to 
be nonpositive; in this case the function returns the value RATPOINTS_BAD_ARGS. 

The field domain must contain a pointer to an array of ratpoints_interval 
structures of length at least num_inter plus degree. This array gives (in its first 
num_inter entries) the intervals for the search region. The type ratpoints_interval 
is just 
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typedef struct {double low; double up;} ratpoints_interval ; 

the meaning of this should be clear. In ratpoints, this is set via the -1 and -u 
options. Usually, you do not want to restrict the range of x-coordinates; then you 
set num_inter = (but you still have to fill domain with a valid pointer to at 
least degree intervals, unless you set sturm to a negative value!) The program 
may alter the values in the array domain points to, unless sturm has a negative 
value (which may lead to a performance loss). 

b_low and bJiigh carry the lower and upper bounds for the denominator; if non- 
positive, they are set to and the height bound, respectively. In ratpoints, these 
fields are set by the -dl and -du options. 

spl and sp2 specify the number of primes to be used in the first sieving stage and 
in both sieving stages together, respectively. If negative, they are set to certain 
default values. In ratpoints, these fields are set by the -n and -N options. 
Similar statements are true for num_primes (option -p), sturm (options -s, -S) 
and max_f orbidden (option -F). The field array_size specifies the maximal size 
(in long words) of the array that is used in the first sieving stage. If non-positive, 
it is set to a default values. The various default values are defined at the beginning 
of the header file ratpoints. h, where also the maximal degree of the polynomial 
is set. 

The flags field holds a number of bit flags. 

• RATPOINTS_NO_CHECK — when set, do not check whether the surviving re- 
coordinates give rise to rational points (set by the -x option to ratpoints). 

• RATP0INTS_N0_Y — only list ^-coordinates (in the form (x : z) e P^Q)) 
instead of actual points (with a y-coordinate); this is set by the -y option 
to ratpoints. 

• RATPOINTS_NO_REVERSE — when set, do not allow reversal of the polyno- 
mial (set by the -k option to ratpoints). 

• RATP0INTS_N0_JAC0BI — when set, prevent the use of the Jacobi symbol 
test (set by the -j option to ratpoints). 

• RATPOINTS-VERBOSE — when set, causes the procedure to print some out- 
put on what it is doing (set by the -v option to ratpoints). 

There are some other flags that are used internally. One of them might be of 
interest: 

• RATPOINTS_REVERSED — when set after the function call, this indicates 
that the polynomial has been reversed (and the contents of the cof array 
have been modified). 

The main vehicle for passing information back to the caller is the proc function 
argument together with the pointer info. This function 

int proc (long x, long z, const mpz_t y, void *info, int *quit) 
is called whenever a point was found, x, y and z are the coordinates of the point 
(where y is a gmp integer), info is the pointer that was passed to f ind_points; 
this can be used to store information that should persist between calls to proc. If 
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*quit is set to a non-zero value, this indicates that f incLpoints should abort the 
point search and return immediately; otherwise the search continues. The return 
value is taken as a weight for counting the points; usually it will be 1. 

The usual framework for using f incLpoints is as follows. 
(...) 

#include "ratpoints .h" 
(...) 

mpz_t c [RATP0INTS_MAX_DEGREE+1] ; /* The coefficients of f */ 
ratpoints.interval domain [2*RATP0INTS_MAX_DEGREE] ; 

/* This contains the intervals representing the 
search region */ 

* function that processes the points * 
typedef struct {...} data; 

int processQong x, long z, const mpz_t y, void *infoO, int *quit) 
{ data *info = (data *)infoO; 

(...) 

return (1) ; 

} 

* main * 

int main (int argc, char *argv[]) 
{ 

long total, n; 
ratpoints_args args; 

long degree = 6; 

long height = 16383; 

long sieve.primesl = RATP0INTS_DEFAULT_SP1 ; 

long sieve_primes2 = RATP0INTS_DEFAULT_SP2 ; 

long num.primes = RATPOINTS_DEFAULT_NUM_PRIMES ; 

long max.forbidden = RATPOINTS_DEFAULT_MAX_FORBIDDEN; 

long b_low = 1; 

long b_high = height; 
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long sturm_iter 
long array_size 
int no_check 
int no_y 
int no_reverse 
int no_jacobi 
int no_output 



= RATPOINTS_DEFAULT_STURM ; 
= RATPOINTS_ARRAY_SIZE ; 
= 0; 
= 
= 
= 
= 



unsigned int flags = 0; 

data *info = malloc(sizeof (data) ) ; 

/* initialise multi-precision integer variables */ 
for(n =0; n <= degree; n++) { mpz_init (c [n] ) ; } 

(. . .) 

{ /* set up polynomial */ 
long k; 

for(k = 0; k < 7; k++) { mpz_set_si (c [k] , ...); } 



args . cof 
args . degree 
args .height 
args . domain 
args . num_ inter 
args .b_low 
args . b_high 
args . spl 
args . sp2 
args . array_size 
args . sturm 
args . num_primes 
args .max_f orbidden 
args .flags 



&c[0] ; 
6; 

height ; 
&domain[0] ; 
0; 

b_low; 
b.high; 

sieve_primesl ; 
sieve_primes2; 
array_size ; 
sturm_iter ; 
num_primes ; 
max_f orbidden; 
flags; 



inf o-> ... = . . . ; 
(...) 

total = f ind_points (feargs , process, (void *)info); 
if (total == RATP0INTS_N0N_SQUAREFREE) 
{ ... } 

if (total == RATP0INTS_BAD_ARGS) 
{ ... > 
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(...) 

} 

/* clean up multi-precision integer variables */ 
for(n =0; n <= degree; n++) {mpz_clear (c [n] ) ; } 

return (0) ; 

} 

If points are to be searched on many curves, then it is slightly more efficient to 
use the sequence 

f ind_points_init (feargs) ; 

for( ... ) 
{ ... 

total = f ind_points_work(&args , process, (void *)info); 

} 

f ind_points_clear (feargs) ; 
This avoids the repeated allocation and freeing of memory. 

For practical examples, see main.c (the code that wraps find_points for the 
command line program ratpoints) or rptest.c (which runs find_points on 

some test data). 

7. FINE-TUNING THE PARAMETERS 

For large computations, it may be a good idea to try to find the best (or at least, a 
good) combination of parameters for the given kind of data. The default values are 
chosen for optimal performance on an Intel (R) Core(TM)2 CPU T7200@2 . 00GHz, 
for random genus 2 curves with small coefficients and a height bound of 2 14 — 1. 
For your machine and input data, other values may be better. You can replace the 
testdata.h file with your own collection of test data (and edit rptest . c if neces- 
sary to adapt the degree and height settings) and then time ./rptest -z (the -z 
option suppresses the output) for various combinations of the parameter settings, 
rptest accepts most of the optional parameters of ratpoints, in particular the 
-p, -N, -n, -F and -S parameters, so you can easily change the parameters used 
on the command line. In addition, there is an option -h H to change the default 
height bound. 

Once you have found a good set of parameter values for your application, you 
can hard-code them as defaults into ratpoints by changing the definitions in 
ratpoints. h (and then make distclean all test), or you can use them to fill 
the ratpoints_args structure for your call to f ind_points. 
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As a number to compare against, on my laptop with the CPU above, make test 
takes about 6.6 seconds. To put this into perspective, note that this means that 
some 24 points are tested on average per CPU cycle. 

8. Implementation 

8.1. Overview. 

Let F(x, z) be the binary form of even degree corresponding to the polynomial on 
the right hand side of the curve equation. The basic idea is to let run b from 1 to 
the height bound H, for each b, let a run from —H to H, and for each coprime 
pair (a, b) check if F(a, b) is a square. 

Of course, in this form, this would take a very long time. To speed up the process, 
we try to eliminate quickly as many pairs (a, b) as possible before the actual test. 
This can be done by 'quadratic sieving' modulo several primes: if F(a, b) is a 
square, it certainly has to be a square mod p, and so we can rule out all (a, b) that 
do not satisfy this condition. Another ingredient is to represent (for a fixed b) 
the various values of a by bits and treat all the bits in a long word (32 or 64, as 
the case may be, or 128 when SSE instructions are used) in parallel. For this, we 
organize the sieving information for each prime p into p arrays of p words each, 
one such array for every b mod p, such that the jth bit in this array is set if and 
only if F(j, b) is a square mod p. 

For each 'denominator' b, we then set up an array of words whose bits represent 
the range —H < a < H (aligned so that a = corresponds to the Oth bit of a 
word); the bits are initially set. Then for each of the sieving primes p, we perform 
a bit-wise and operation between this array and the sieving information at p. In 
a first stage, this is done on the whole array; after this first stage, each remaining 
('surviving') non-zero word in the array is subject to tests with more primes. If 
some bits are still set after this second stage of sieving, the corresponding pairs 
(a, b) (if coprime) are then checked exactly. 

In the following subsections, we discuss a number of improvements that were made. 

8.2. Sorting the primes. 

This idea is due to Colin Stahlke. We do not just take the first so many primes 
in increasing order for the sieving, but we first compute the number of points the 
curve has mod p for a number of primes p and then sort the primes according to 
the fraction of x-coordinates that give points. We then take those primes for the 
sieving that have the smallest fraction of 'surviving' ^-coordinates. In this way, 
we need fewer sieving primes to achieve a comparable reduction of point tests. 

8.3. Using connected components. 

We note that we can only have points when F(a, b) is no n- negative. If we can 
determine intervals on which f(x) = F(x, 1) is negative, then we do not have to 
look for points in these intervals. The necessary computations can be performed 
exactly, by computing a Sturm sequence for / and counting the number of sign 
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changes at various points, see [Coh] Thm. 4.1.10]. This can in particular tell us 
whether F is negative definite, in which case we have already proved that there 
are no rational points in the curve. If there are real zeros, we use a subdivision 
method in order to find a collection of intervals containing the projections of the 
connected components of the curve over M. 

8.4. Using 2-adic information. 

John Cremona suggested that in some cases, one can determine beforehand that 
all points will have odd 'numerators' a, and so we can pack the bits more tightly 
by only representing odd numbers. This approach can be extended. We first find 
all solutions mod 16 (higher powers of 2 would be possible, but not very likely to 
give an improvement). Then for every residue class mod 16 of the denominator b, 
we can find the residue classes mod 16 of potential numerators a. If there are 
none, then we can eliminate b as a denominator altogether. If all potential a's are 
even or odd (this will always be the case for even denominators), we can restrict 
the sieving to such numerators. 

8.5. Elimination of denominators. 

Depending on the equation, certain denominators can be excluded. We have seen 
an instance of this in the previous subsection, but we can also work with odd 
primes. 

8.5.1. Odd degree and ±monic. If the polynomial has odd degree and leading 
coefficient ±1, then the denominator has to be a square. This reduces the time 
complexity tremendously. 

8.5.2. Odd degree general. In general, when / has odd degree, the denominator 
has to be 'almost' a square: it must be a square times a (squarefree) divisor of the 
leading coefficient, and there are further restrictions on the parity of the valuation 
at p, for p dividing the leading coefficient, when this valuation is sufficiently large. 
This gives the same type of time complexity as in the monic case, but with a larger 
constant. 

8.5.3. Even degree with non-square leading coefficient. Here we can exclude de- 
nominators divisible by a prime p such that the leading coefficient is a non-square 
mod p. We can also in some cases rule out denominators divisible by a certain 
power of p when p is a prime that divides the leading coefficient (if the leading 
coefficient is not a p-adic square). While computing the sieving information, we 
make a list of such primes and prime powers, which we then use later to eliminate 
denominators. In addition, we use the necessary condition that the Jacobi symbol 
(p-) must be +1, where I is the leading coefficient and b' is the part of b coprime 
to 21. 
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8.6. Reversing the polynomial. 

The exclusion of denominators described above is more or less effective, depending 
on the situation; it is the better the earlier the case was described. Since the height 
of the points does not change under the transformation (x, y) — > (l/x,y/x^ n / 2 ^), 
we can as well search on the reversed polynomial F(z,x). This will result in a 
speed-up if the reversed polynomial belongs to a 'better' class than the original. 

8.7. Some general remarks. 

Modern processors are very fast when doing basic things like moving data around 
between registers or simple arithmetic operations (addition, subtraction, shift, . . . , 
comparison). Even memory access can be fast when there is no cache miss. On 
the other hand, integer division is a slow process. It turned out that quite some 
improvement of the performance was possible by removing as many instances of 
integer division operations as reasonably possible. 

For example, we use gcd and Jacobi symbol routines that rely (almost) entirely 
on differences and shifts. We compute the residue classes of b modulo the various 
sieving primes by addition of the difference from the last b and then correcting 
by subtracting p a number of times if necessary, and we implement most of the 
testing of 'forbidden divisors' of b using bit arrays similar to those used for sieving 
the numerators. 

8.8. Change log. 

Version 2.0 was released January 9, 2008. 

Version 2.0.1 was released July 7, 2008. It fixes a bug that prevented the '-1' 
option to work properly. 

Version 2.1 was released March 9, 2009. It makes use of the SSE instructions, 
so that the sieve can work on 128 bits in parallel (instead of on 64 or 32). On my 
laptop (with an Intel Core2 processor), make test runs about 25% faster than 
before. 

Version 2.1.1 was released April 14, 2009. It fixes a bug that in some cases 
prevented an early abort when *quit was set by the callback function. Thanks to 
Robert Miller for the bug report and the fix. In addition, it is now checked that 

W0RDSIZE == 64 before SSE instructions are used (in rp-private .h), since 

the program then assumes that a bit array consists of two unsigned longs. In 

this context, SSE2 has been replaced by a new macro USE.SSE. A further fix 

eliminates unncessesary copying of sieving information when SSE instructions are 
not used (introduced in version 2.1). This was almost always harmless, but could 
have resulted in memory corruption in the extreme case that all the information 
had to be computed for all the primes. 
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